Interpreting Blackbox Models via Model Extraction

نویسندگان

Osbert Bastani

Carolyn Kim

Hamsa Bastani

چکیده

ABSTRACT The ability to interpret machine learning models has become incredibly important as machine learning is increasingly used to inform consequential decisions. We propose an approach to interpreting complex, blackbox models by constructing global explanations that summarize their reasoning process. In our approach, a global explanation is a decision tree that approximates the blackbox model. As long as the decision tree is a good approximation, then the reasoning process of the decision tree mirrors that of the blackbox model. We devise a novel algorithm for extracting decision tree explanations that actively samples new training points to avoid overfitting. We evaluate our algorithm on a random forest to predict diabetes risk, and a learned control policy for the cart-pole problem. Compared to several baselines, the decision trees extracted by our algorithm are substantially more accurate and are equally or more interpretable based on a user study. Finally, we describe several insights we derived based on our interpretations, including a causal issue that we validated with a physician.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Interpretability via Model Extraction

e ability to interpret machine learning models has become increasingly important now that machine learning is used to inform consequential decisions. We propose an approach called model extraction for interpreting complex, blackbox models. Our approach approximates the complex model using a much more interpretable model; as long as the approximation quality is good, then statistical properties...

متن کامل

Long-term Iran's inflation analysis using varying coefficient model

Varying coefficient Models are among the most important tools for discovering the dynamic patterns when a fixed pattern does not fit adequately well on the data, due to existing diverse temporal or local patterns. These models are natural extensions of classical parametric models that have achieved great popularity in data analysis with good interpretability.The high flexibility and interpretab...

متن کامل

Bayesian Analysis of Spatial Probit Models in Wheat Waste Management Adoption

The purpose of this study was to identify factors influencing the adoption of wheat waste management by wheat farmers. The method used in this study using the spatial Probit models and Bayesian model was used to estimate the model. MATLAB software was used in this study. The data of 220 wheat farmers in Khouzestan Province based on random sampling were collected in winter 2016. To calculate Bay...

متن کامل

Towards blackbox identity testing of log-variate circuits

Derandomization of blackbox identity testing reduces to extremely special circuit models. After a line of work, it is known that focusing on circuits with constant-depth and constantly many variables is enough (Agrawal,Ghosh,Saxena, STOC’18) to get to general hitting-sets and circuit lower bounds. This inspires us to study circuits with few variables, eg. logarithm in the size s. We give the fi...

متن کامل

Extraction Kinetics and Physicochemical Studies of Terminalia catappa L Kernel Oil Utilization Potential

Kinetics and selected variables (temperature, particle size and time) for extraction of Terminalia Catappa L Kernel Oil (TCKO) were investigated using solvent extraction. Kinetic models studied were: parabolic diffusion, power law, hyperbolic, Elovich and pseudo-second-order. In ascending order, the best-fitted models at the optimum temperature and oil yield were Elovich’s model, hyperbolic...

متن کامل

ذخیره در منابع من

ذخیره در منابع من قبلا به منابع من ذحیره شده

{@ msg_add @}

با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

CoRR

دوره abs/1705.08504 شماره

صفحات -

تاریخ انتشار 2017

Interpreting Blackbox Models via Model Extraction

نویسندگان

چکیده

منابع مشابه

Interpretability via Model Extraction

Long-term Iran's inflation analysis using varying coefficient model

Bayesian Analysis of Spatial Probit Models in Wheat Waste Management Adoption

Towards blackbox identity testing of log-variate circuits

Extraction Kinetics and Physicochemical Studies of Terminalia catappa L Kernel Oil Utilization Potential

عنوان ژورنال:

اشتراک گذاری